PATENT CLAIMS 

claim: 

A computer implemented method of analyzing electronic data comprising the steps of: 

a) providing a processing unit capable of receiving electronic data; 

b) further providing a storage device coupled to said processing unit; 

c) accessing one or more electronic data files, each said data file having a structure; 

d) analyzing said one or more electronic data files to identify record break 

information contained therein; 

e) utilizing said record break information, parsing said one or more data files into 

one or more electronic data records; 

f) analyzing each of said electronic data records to identify field break information 

contained therein; 

g) utilizing said field break information, parsing each of said data records into one or 

more data fields; and, 

h) generating output data describing said structure of said one or more electronic data 

files. 

The method of claim 1, further comprising the steps of: 
repeating steps d) through g); and 

utilizing said record break information and said field break information, updating said 
output data. 

The method of claim 1 or 2, further comprising the step of: 
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1 storing said output data within said storage device. 
2 

3 4. The method of claim 1, further comprising the step of: 

4 assigning a tokenized symbolic identifier to one or more of said data fields. 
5 

6 5. The method of claim 1, further comprising the step of: 

7 providing a user interface through which a user may modify said output data, said user 

8 interface coupled to said storage device. 
9 

1 |!J 6. The method of claim 1 , further comprising the step of: 

p 

1 p utilizing said output data, generating a translation document capable of translating 

i 

lf^ electronic documents into one or more predefined formats. 

m 

l4* 7. The method of claim 1, further comprising the steps of: 

PI 

1 §1 receiving modification instructions ; 

Si 

1|t applying said modification instructions to one or more of said data fields; and 

17 generating a first plurality of data files containing one or more modified data fields. 

18 

19 8. The method of claim 7, further comprising the step of: 

20 testing said first plurality of data files. 
21 

22 9. The method of claim 1, further comprising the step of: 

23 identifying a file type associated with each of said electronic data files. 
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1 

2 1 0. The method of claim 1 , further comprising the step of: 

3 combining substantially similar electronic data files. 
4 

5 11. The method of claim 1 , further comprising the steps of: 

6 identifying one or more types of said electronic data records; and 

7 analyzing said record type of each of said electronic data records to determine a degree of 

8 similarity. 
9 

1 §3 12. The method of claim 1 1 > further comprising the step of: 

P 

1 gj determining a cardinality for each said record type. 

1 f J 13. The method of claim 1 1 , further comprising the step of: 

i k* determining a sequence of representation for each said record type. 

w 

l pf 14. The method of claim 1 1 , further comprising the step of: 

1 7 representing said degree of similarity of each said record type within said output data. 
18 

19 15. The method of claim 12, further comprising the step of: 

20 representing said cardinality of each said record type within said data file. 
21 

22 16. The method of claim 13, further comprising the step of: 

23 representing said sequence of representation for each said record type within said data file. 
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1 

2 17. A computer readable medium comprising a plurality of instructions for analyzing computer 

3 intelligible electronic data which, when read by a computer system having a processing unit capable 

4 of receiving electronic data coupled to a storage device capable of storing electronic data, causes the 

5 computer to perform the steps of: 

6 a) accessing one or more electronic data files, each said data file having a structure; 

7 b) analyzing said one or more electronic data files to identify record break 

8 information contained therein; 

9 c) utilizing said record break information, parsing said one or more data files into 



one or more electronic data records; 



Li 

|| d) analyzing each of said electronic data records to identify field break information 

m 

%k contained therein; 

ij e) utilizing said field break information, parsing each of said data records into one or 
4* more data fields; 

nj 

51 f) generating output data describing said structure of said one or more 

'""4 
P 

.$k electronic data files; and, 



8 18. The medium of claim 17, wherein said plurality of instructions causes the computer to 

9 perform the additional steps of: 

20 repeating steps b) through e); 

21 utilizing said record break information and said field break information, updating said 

22 output data. 
23 
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1 1 9. The medium of claim 1 7 or 1 8, further comprising the step of: 

2 storing said output data within said storage device. 
3 

4 20. The medium of claim 17, wherein said plurality of instructions causes the computer to 

5 perform the additional step of: 

6 assigning a tokenized symbolic identifier to one or more of said data fields. 
7 

8 21. The medium of claim 17, wherein said plurality of instructions causes the computer to 

9 perform the additional step of: 

ifc providing a user interface through which a user may modify said output data, said user 

hi 

□ 

1 p interface coupled to said storage device. 

lfU 22. The medium of claim 17, wherein said plurality of instructions causes the computer to 

\%™ perform the additional step of: 

W 

15* utilizing said output data, generating a translation document capable of 

lp* translating electronic documents into one or more predefined formats. 

17 

18 23. The medium of claim 17, wherein said plurality of instructions causes the computer to 

19 perform the additional steps of: 

20 receiving modification instructions; 

21 applying said modification instructions to one or more of said data fields; and 

22 generating a first plurality of data files containing one or more modified data fields. 
23 
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1 24. The medium of claim 23 , wherein said plurality of instructions causes the computer to 

2 perform the additional step of: 

3 testing said first plurality of data files. 
4 

5 25. The medium of claim 17, wherein said plurality of instructions causes the computer to 

6 perform the additional step of: 

7 identifying a file type associated with each of said electronic data files. 
8 

9 26. The medium of claim 17, wherein said plurality of instructions causes the computer to 
perform the additional step of: 



1 fcj combining substantially similar electronic data files. 

J 
12- 

13y 27. The medium of claim 17, further comprising the steps of: 

14** identifying one or more types of said electronic data records; and 

W 

15S analyzing said record type of each of said electronic data records to determine a degree of 

l<rf similarity. 

17 

1 8 28. The medium of claim 27, further comprising the step of: 

1 9 determining a cardinality for each said record type. 
20 

21 29. The method of claim 27, further comprising the step of: 

22 determining a sequence of representation for each said record type. 
23 
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1 30. The method of claim 27, further comprising the step of: 

2 representing said degree of similarity of each said record type within said output data. 
3 

4 31. The method of claim 28, further comprising the step of: 

5 representing said cardinality of each said record type within said data file. 
6 

7 32. The method of claim 29, further comprising the step of: 

8 representing said sequence of representation for each said record type within said data file. 
9 

ly! 33. A computer system for analyzing computer intelligible electronic data comprising: 

a 

1 J* a processing unit for accessing one or more electronic data files, each said data file having a 

m 

1|U structure, for analyzing said one or more electronic data files to identify record break information 

i 

1 8| contained within said files, for parsing said one or more data files into one or more electronic data 

1 4* records according to said record break information, for analyzing each of said electronic data records 

l!N to identify field break information contained within said records, for parsing each of said data 

lfg records into one or more data fields according to said field break information and for generating or 

17 output data describing said structure of said one or more electronic data files. 
18 

19 34. The computer system of claim 33, wherein said processing unit is further defined as being 

20 capable of updating said output data. 
21 

22 35. The computer system of claim 33, wherein said computer system further comprises a storage 

23 device, said processing unit being capable of storing said output data within said storage device. 



-29- 



1 



2 36. The computer system of claim 33, wherein said processing unit is further defined as being 

3 capable of assigning a tokenized symbolic identifier to one or more of said electronic data fields. 
4 

5 37. The computer system of claim 33, wherein said computer system further comprises an 

6 interface through which a user may modify said output data, said interface being coupled to said 

7 processing unit. 
8 

9 38. The computer system of claim 33, wherein said processing unit is further defined as being 

1 capable of generating a translation document capable of translating electronic data into one or more 

p 

1 |;j predefined formats. 

■ CJ 

f 

13W. 39. The computer system of claim 33, wherein said processing unit is further defined as being 

ST 

14V. capable of receiving modification instructions, applying said modification instructions to one or 

1 more of said data fields and generating a first plurality of data files containing one or more modified 
data fields. 

17 

1 8 40. The computer system of claim 39, wherein said processing unit is further defined as being 

1 9 capable of testing said first plurality of data files. 
20 

21 41. The computer system of claim 33, wherein said processing unit is further defined as being 

22 capable of identifying a file type associated with each of said electronic data files. 
23 
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1 42. The computer system of claim 33, wherein said processing unit is further defined as being 

2 capable of combining a first plurality of said electronic data files having a substantially similar 

3 structure. 
4 

5 43. The computer system of claim 33, wherein said record break information comprises one or 

6 more line termination characters. 
7 

8 44. The computer system of claim 33, wherein said record break information comprises one or 

9 more record break characters. 

■I 

1 f i 45. The computer system of claim 33, wherein said field break information comprises one or 

1$* more character type transitions. 

vQ 

m 

lk w 46. The computer system of claim 33, wherein said field break information comprises one or 
1 if 1 more character counts . 

M 
o 
iff 

17 47. The computer system of claim 33, wherein said processing unit is further defined as being 

1 8 capable of identifying one or more types of said electronic data records and analyzing said types of 

19 said electronic data records to determine a degree of similarity. 
20 

21 48. The computer system of claim 47, wherein said processing unit is further defined as being 

22 capable of determining a cardinality for each said record type. 
23 
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1 49. The computer system of claim 47, wherein said processing unit is further defined as being 

2 capable of determining a sequence of representation for each said record type. 
3 

4 50. The computer system of claim 47, wherein said processing unit is further defined as being 

5 capable of representing said degree of similarity of each said record type within said output data. 
6 

7 51. The computer system of claim 48, wherein said processing unit is further defined as being 

8 capable of representing said cardinality of each said record type within said data file. 



(L 52. The computer system of claim 49, wherein said processing unit is further defined as being 
capable of representing said sequence of representation for each said record type within said data 

"is?? ■ 

y.j 

XI file. 
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